Multi-Model and Crosslingual Dependency Analysis

نویسندگان

Johannes Heinecke

Munshi Asadullah

چکیده

This paper describes the system of the team Orange-Deskiñ, used for the CoNLL 2017 UD Shared Task. We based our approach on an existing open source tool (BistParser), which we modified in order to produce the required output. Additionally we added a kind of pseudoprojectivisation. This was needed since some of the task’s languages have a high percentage of non-projective dependency trees. In most cases we also employed word embeddings. For the 4 surprise languages, the data provided seemed too little to train on. Thus we decided to use the training data of typologically close languages instead. Our system achieved a macro-averaged LAS of 68.61% (10th in the overall ranking) which improved to 69.38% after bug fixes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsing Natural Language Sentences by Semi-supervised Methods

We present our work on semi-supervised parsing of natural language sentences, focusing on multi-source crosslingual transfer of delexicalized dependency parsers. We first evaluate the influence of treebank annotation styles on parsing performance, focusing on adposition attachment style. Then, we present KLcpos3 , an empirical language similarity measure, designed and tuned for source parser we...

متن کامل

Exploring Cross-Lingual Transfer of Morphological Knowledge In Sequence-to-Sequence Models

Multi-task training is an effective method to mitigate the data sparsity problem. It has recently been applied for crosslingual transfer learning for paradigm completion—the task of producing inflected forms of lemmata—with sequenceto-sequence networks. However, it is still vague how the model transfers knowledge across languages, as well as if and which information is shared. To investigate th...

متن کامل

Power SystemAnalysis for Nonsinusoidal Steady State Studies Based onWavelets

In this paper power system model is represented in a new domain that relates to Multi-Resolution Analysis (MRA) space. By developing mathematical model of elements in this space using Galerkin method, a new alternative method for power system simulation in nonsinusoidal and periodic conditions is developed. The mathematical formulation and characteristics of new proposed space is expressed. Als...

متن کامل

Inverted indexing for cross-lingual NLP

We present a novel, count-based approach to obtaining inter-lingual word representations based on inverted indexing of Wikipedia. We present experiments applying these representations to 17 datasets in document classification, POS tagging, dependency parsing, and word alignment. Our approach has the advantage that it is simple, computationally efficient and almost parameter-free, and, more impo...

متن کامل

Cross-Lingual Syntactically Informed Distributed Word Representations

We develop a novel cross-lingual word representation model which injects syntactic information through dependencybased contexts into a shared cross-lingual word vector space. The model, termed CLDEPEMB, is based on the following assumptions: (1) dependency relations are largely language-independent, at least for related languages and prominent dependency links such as direct objects, as evidenc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Multi-Model and Crosslingual Dependency Analysis

نویسندگان

چکیده

منابع مشابه

Parsing Natural Language Sentences by Semi-supervised Methods

Exploring Cross-Lingual Transfer of Morphological Knowledge In Sequence-to-Sequence Models

Power SystemAnalysis for Nonsinusoidal Steady State Studies Based onWavelets

Inverted indexing for cross-lingual NLP

Cross-Lingual Syntactically Informed Distributed Word Representations

عنوان ژورنال:

اشتراک گذاری